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The mechanism of electroweak symmetry breaking in 
the standard model (SM) [l|, Q predicts the existence 
of a fundamental scalar boson, referred to as the Higgs 
boson (H). Although there is strong evidence of elec- 
troweak symmetry breaking, the Higgs boson has yet 
to be observed. The SM does not predict the mass 
of the Higgs boson, m# , but the combination of preci- 
sion electroweak measurements 0], including recent top 
quark and W boson mass measurements from the Teva- 
tron [1, constrains m H < 152 GcV/c 2 at the 95% 
confidence level. Direct searches at LEP2 the Teva- 
tron and the LHC [8, 9] exclude all possible masses 
of the SM Higgs boson at the 95% confidence level or 
the 95% credibility level (C.L.), except within the ranges 
116.6 - 119.4 GcV/c 2 and 122.1 - 127 GeV/c 2 . A SM 
Higgs boson in these mass ranges would be produced in 
the y/s = 1.96 TeV pp collisions of the Tevatron and 
have a branching fraction to bb greater than 50% flOr - 
121 ] . While the most sensitive searches for the SM Higgs 
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boson at the LHC are those based on its decays into pairs 
of gauge bosons, searches based on decays into pairs of 
b quarks are the most sensitive at the Tevatron. The 
searches at the LHC in the four-lepton and diphoton fi- 
nal state offer precise measurements of the mass of the 
Higgs boson, while the results presented here provide di- 
rect information about the Higgs boson's couplings to b 
quarks and are therefore complementary to the primary 
LHC search modes. In searches for the production of a 
Higgs boson in association with a vector boson ( WH or 
ZH ) . lcptonic decays of the vector boson provide effec- 
tive discrimination between the expected signal and the 
large, uncertain hadronic backgrounds. Previous Higgs 
searches focused on these production and decay modes 
have been performed at LEP2 0] and the LHC [lJ,[Ll. 
This Letter describes the combination of the results of 
three CDF searches for a SM-like Higgs boson with a 
mass in the range 90 < to# < 150 GeV/c 2 . These 
searches are targeted at ZH -J> g + £~bb WH ivbb 
0, and WH, ZH -> $ T bb [13]. 

The CDF II detector is described in detail else- 
where [HE!. 

Calorimeter energy deposits are clustered 
into jets using a cone algorithm with an opening angle of 
AR = ^(A^) 2 + (A? ? ) 2 = 0.4 |. High-p T electron 
candidates are identified by match ing chargcd-particlc 



tracks in the tracking systems [2l|, |22j with energy de- 
posits in the electromagnetic calorimeters j23[. Muon 
candidates are identified by matching tracks with muon- 



detcctor track segments [241 ], The hermeticity of the 



calorimeter allows for good reconstruction of the missing 
transverse energy (^) [ll]- Jets are identified as consis- 
tent with the fragmentation of a 6 quark (6-tagged) using 
three different algorithms described in Ref. [261 ]. which 
make use of track impact parameters, the presence of 
identified displaced vertices, the presence of leptons near 
the jet, and jet kinematic properties. The average tag 
efficiency for a jet originating (not originating) from b 
quark fragmentation is in the range 42-70% (0.9-8.9%), 
depending on the properties of the jet. 

Higgs boson signal events are simulated using 
PYTHIA [27j, with CTEQ5L [28| parton distribution func- 
tions (PDFs) at leading order (LO). We normalize our 
Higgs boson signal-rate predictions to the most recent 
higher-order calculations available. The WH and ZH 
cross section calculations are performed at next-to-next- 
to leading order (NNLO) precision in QCD and next- 
to-leading-order (NLO) precision in the electroweak cor- 
rections and are described in Ref. The branching 
fractions for the Higgs boson decays are obtained from 
Ref. [Hf. These rely on calculations using HDECAY (29j 
and PROPHECY4F [5(J. Assuming the m H = 125 GeV/c 2 
hypothesis, we expect approximately 85 Higgs boson 
events to pass our selections. We model SM and instru- 
mental background processes using a mixture of Monte 
Carlo (MC) and data-driven methods. Diboson ( WW , 
WZ , ZZ ) MC samples are normalized using the NLO 



calculations from MCFM [31] . For tt we use a produc- 
tion cross section of 7.04 ± 0.7 pb I32J, which is based 
on a top-quark mass of 173 GcV/c 2 0| and MSTW 2008 
NNLO PDFs [3^] • The singlc-top-quark production cross 
section is taken to be 3.15 ± 0.31 pb [34j. The normal- 
ization of the Z+\ ets and VF+jets MC samples is taken 
from ALPGEN 35j corrected for NLO effects, except in 
the case of the WH — > Ivbb search. The normalization 
of the VF+jets MC sample in the WH — > Ivbb search, 
and normalization of the instrumental and QCD multi- 
jet samples in all searches, are constrained from data 
samples where the expected signal is several orders of 
magnitude smaller than in the search samples. 

All searches use the same data sample, which corre- 
sponds to 9.45 fb _1 of integrated luminosity [3(|. The 
analysis channels select non-overlapping subsets of the 
data. Exactly two, one, or zero charged leptons are 
required by the ZH -> i + l~bb, WH -> Ivbb, and 
WH, ZH — > $ T bb event selections, respectively, where 
I denotes a reconstructed electron or muon. Both the 
WH — s- ivbb and WH, ZH — > $ T bb event selections re- 
quire large $ T to be consistent with the signature of one 
or more high-p^ neutrinos escaping the detector. Events 
in all searches are required to contain exactly two or 
three reconstructed jets. To optimize the sensitivity, the 
data in each search are further divided into independent 
sub-channels composed of differing jet multiplicity, lep- 
ton quality, 6-tag multiplicity, and 6-tag quality. There 
are 16, 26, and 3 sub-channels for the ZH — > £ + £~bb, 
WH -t ivbb, and WH, ZH -» $ T bb analyses, respec- 
tively, totaling to 45 for the combination presented here. 
For a pair of jets, the dijet mass resolution for signal 
events at CDF is expected to be 10-15% of their mean 
reconstructed mass [371 ]. The decay width of the Higgs 
boson signal is predicted to be much smaller than this 
mass resolution. The presence of a signal would appear 
as a broad enhancement in the invariant mass distribu- 
tion of jets. The dijet mass provides the greatest dis- 
crimination between signal and background. However, to 
enhance sensitivity the dijet mass is combined with other 
kinematic information into multivariate discriminants. 

Each search sub-channel uses a multivariate analysis 
(MVA) technique designed to separate the Higgs boson 
signal from the backgrounds. The MVA functions are 
optimized separately for each subchannel and for 13 in- 
dependent mass hypotheses at each value of m# in the 
range 90 - 150 GeV/c 2 , in 5 GeV/c 2 intervals. We inter- 
pret the results using a Bayesian technique, separately 
at each value of m#, using a combined likelihood formed 
from a product of likelihoods for the individual channels, 
each of which is a product over histogram bins of the 
MVA outputs, 



N C N bi , 
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In this expression, the first product is over the number of 
channels (Nq), and the second product is over histogram 
bins containing n,-j events, binned in ranges of the final 
discriminant variables used for the individual analyses. 
The predictions for the bin contents are //jj = Rx Sij(9) + 
bij(9) for channel i and histogram bin j, where Sij and 
bij represent the expected SM signal and background in 
the bin, and R is a scaling factor applied to the signal. 
By scaling all signal contributions by the same factor, 
we assume that the relative contributions of the different 
processes are as given by the SM. 

Systematic uncertainties are parametrized by the de- 
pendence of and on 8. Each of the n sys components 
of 8, 8k, corresponds to a single independent source of 
systematic uncertainty, and each parameter may have an 
impact on several sources of signal and background in dif- 
ferent channels, thus accounting for correlations. Gaus- 
sian priors are assumed for the 9k, truncated so that no 
prediction is negative. The likelihood function, multi- 
plied by the 8k priors, Tr(8k), is then integrated over 8k 
including correlations [381 ] . 

£'{R) = J £(R,s,b\nJ)TT(8)d8. (2) 

We assume a uniform prior in R to obtain its poste- 
rior distribution. The observed 95% C.L. upper limit on 

R, R$? s , satisfies 0.95 = f£™ C'{1l)d1l. The expected 
distribution of Rg§ is computed in an ensemble of pseudo- 
experiments generated without signal. In each pseudoex- 
pcrimcnt, random values of the nuisance parameters are 
drawn from their priors. The median expected value of 
i?95 in this ensemble is denoted i?g5 P - A combined mea- 
surement of the cross section for Higgs boson production 
assuming SM branching ratios in units of the SM pro- 
duction rates is given by i? fit , which is the value of R 
that maximizes CJ . The 68% C.L. interval (one standard 
deviation) is quoted as the shortest interval containing 
68% of the integral of the posterior. 

Though many sources of systematic uncertainty dif- 
fer among the analyses, all correlations are taken into 
account in the combined limits, cross sections, and p- 
values. The uncertainties on the signal production cross 
sections are estimated from the factorization and renor- 
malization scale variations, which includes the impact of 
uncalculated higher-order corrections, uncertainties due 
to PDFs, and the dependence on the strong coupling 
constant, (a s ). The resulting uncertainties on the in- 
clusive WH and ZH production rates are 5% [l3]. We 
assign uncertainties to the Higgs boson decay branch- 
ing ratios as calculated in Ref. [39|. These uncertain- 
ties arise from imperfect knowledge of the mass of the b 
and c quarks, a s , and theoretical uncertainties in the bb 
decay rates. The largest sources of uncertainty on the 
dominant backgrounds in the 6-tagged channels are the 
rates of y+heavy flavor jets, where V = W or Z, which 



are typically 30% of the predicted values. The poste- 
rior uncertainties on these rates are typically 8% or less. 
Because the different analyses use different methods to 
obtain the y+heavy flavor predictions, we treat their un- 
certainties as uncorrelated between the Ivbb, the $ T bb, 
and l + i~bb channels. We use simulated events to study 
the impact of the jet energy scale uncertainty [20] on the 
rates and shapes of the signal and background expecta- 
tions. We observe that the jet energy scale uncertainty 
is highly constrained by the data in the individual chan- 
nels. Because differences between channels in the event 
selection and modeling of the background shapes affect 
the constraint on the jet energy scale obtained from the 
fit, we conservatively choose to treat the jet energy scale 
variations uncorrelated between the three analyses in the 
combined search. 

Uncertainties on lepton identification and trigger effi- 
ciencies range from 2% to 6% and are applied to both 
signal- and MC-based background predictions. The un- 
certainty on the integrated luminosity of 6% arises from 
uncertainties in the luminosity monitor acceptance and 
the inelastic pp cross section [40( , and is assumed to be 
correlated between the signal- and MC-based background 
predictions. 

To validate our background modeling and search meth- 
ods, we additionally perform a search for SM diboson pro- 
duction in the same final states used for the SM H — >• bb 
searches. The NLO SM cross section for VZ times the 
branching fraction of Z — > bb is 682 ± 50 fb, which is com- 
parable to the 410 ± 20 fb cross section times branching 
fraction of VH(H -> bb) for a 100 GeV/c 2 SM Higgs bo- 
son. The data sample, reconstruction, background mod- 
els, uncertainties, and subchannel divisions are identical 
to those of the SM Higgs boson search, but the discrim- 
inant functions are trained specifically for the signal of 
SM diboson production. The measured cross section for 
VZ is 4.1 ± 1.3 pb (stat+syst), which is consistent with 
the SM prediction of 4.4 ± 0.3 pb and corresponds to a 
diboson signal significance of ~3.2 standard deviations. 

To better visualize the data, we combine the his- 
tograms of the final discriminants, adding the contents of 
bins with similar signal-to-background ratio {s/b). Fig- 
ure [T] shows the signal expectation and the data with 
the background subtracted, as a function of the s/b of 
the collected bins, for the diboson analysis described 
above and for the combined Higgs boson search, assum- 
ing m>H = 125 GeV/c 2 . The background model has been 
fit to the data, and the uncertainties on the background 
are those after the nuisance parameters have been con- 
strained in the fit. An excess of Higgs boson candidate 
events in the highest s/b bins relative to the background- 
only expectation is observed in Fig. [T] 

We extract limits on SM Higgs boson production in 
the run range of 90-150 GeV/c 2 . We present our re- 
sults in terms of i?g5 S , the ratio of the limits obtained 
to the rate predicted by the SM, as a function of the 
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FIG. 1: Background-subtracted distributions for the dis- 
criminant histograms, summed for bins with similar signal- 
to-background ratio (s/b), for the diboson search (top) and 
the H -> bb (m H = 125 GeV/c 2 ) search (bottom). The 
background has been fit to the data, and the uncertainty 
on the background is the post-fit uncertainty. The signal 
model, which is normalized to the SM expectation, is shown 
with a filled histogram. The uncertainty on the background- 
subtracted data points shown is the square root of the post-fit 
background prediction in each bin. The leftmost bin contains 
all events of lower s/b values. 



Higgs boson mass. We assume the SM ratio for WH 
and ZH production. A value of Rg§ s less than or equal 
to one indicates a SM Higgs boson mass that is ex- 
cluded at the 95% C.L. These limits are shown, together 
with the median expected values and distributions of 
individual experiments assuming a signal is absent in 
Fig. [2] We also compute the best-fit rate parameter 
i? fit , which, when multiplied by the SM prediction for 
the associated production cross section times the decay 
branching ratio (<twh + &zh) x B(H — > bb), yields the 
best fit values for this product. We show our fitted 
[pwn + &zh) x B(H — > bb) as a function of ran , along 
with the SM prediction, in Fig. [3] 

Significances of excesses in data over the back- 
ground prediction are computed by calculating the local 
background-only p-value using i? fit as the test statistic. 
This p-value is the probability that i? fit is equal to or 
exceeds its observed value, assuming a signal is truly ab- 
sent. The Look-Elsewhere Effect (LEE) [4l|,|42[ accounts 
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FIG. 2: Observed and expected 95% C.L. upper limits on SM 
Higgs boson production (Rgs) as a function of Higgs boson 
mass. The shaded bands indicate the credibility bands in 
which i?95 is expected to fluctuate, in the absence of signal. 
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FIG. 3: The best-fit cross section times branching ratio 
(pwH + ozh) x B(H — > bb) as a function of tuh- The dark- 
shaded region shows the one standard deviation C.L. band, 
the light-shaded region shows the two standard deviation C.L. 
region, and the SM prediction is shown as the smooth, falling 
curve with a narrow band indicating the theoretical uncer- 
tainty. 



for the possibility of a background fluctuation affecting 
the local p-value anywhere in the tested mji range, here 
taken to be from 115 to 150 GeV/c 2 , owing to the prior 
exclusion [f|. In this mass range, the reconstructed mass 
resolution is approximately 15-20 GeV/c 2 . We therefore 
estimate that two independent outcomes are possible in 
these searches (LEE factor «2). The p- value is computed 
for each mn in the range 90-150 GeV/c 2 , and is shown 
in Fig. SJ Also shown are the expected values of the p- 
value assuming a SM signal is present, testing each value 
of mjj in turn. The maximum local significance corre- 
sponds to 2.7 standard deviations at tuh = 135 GeV/c 2 . 
Correcting for the LEE yields a global significance of 2.5 
standard deviations. 

In summary, we present a combination of CDF searches 
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FIG. 4: Background-only p-value for the combined search. 
Also shown are the median expected values and the one and 
two standard deviation C.L. bands assuming a SM signal is 
present, evaluated at each mn separately. 



for the SM Higgs boson decaying to bb pairs using the 
entire Run II data sample. We search for a Higgs boson 
with a mass between 90 and 150 GeV/c 2 , and exclude 
Higgs bosons with masses smaller than 96 GeV/c 2 . The 
observed credibility limits are higher than those expected 
in the background-only hypothesis in the mass range 115— 
150 GcV/c 2 . Within the currently non-excluded mass 
ranges, the lowest local p-value is found for a Higgs boson 
mass of 125 GeV/c 2 , where the local significance of this 
deviation with respect to the background-only hypothesis 
is 2.7 standard deviations. At the same mass hypothesis, 
we measure an associated production cross section times 
the decay branching ratio of (ctwh + &zh) x B(H — > 
bb) = 291±iif (stat + sys) fb. 

This result is of fundamental interest both because sim- 
ilar searches are difficult at the LHC and because verifi- 
cation of a Higgs-boson-like particle decaying to b quarks 
would offer a measurement of the 6-quark Yukawa cou- 
pling, further establishing the mechanism of electroweak 
symmetry breaking as the source of fcrmionic mass in the 
quark sector. 
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